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The Cambridge Structural Database ( CSD ) contains a 
large amount of molecular structure data ( bond length, 
bong angle and torsion angle data.) Much of this data 
has previously been extracted in histogram form and 
provided in the Mogul program. Histograms however 
have several disadvantages e.g. they are not smooth, 
they depend on bin widths and bin end points. 

Kernel density estimators do not bin data and have no 
end points but centre a kernel function at each data point 
and smooth kernel functions will generate smooth density 
estimates [1]. A difficulty of the approach though is how 
wide to make the kernel functions. 

In this work kernel density estimation is used to gener- 
ate probability density functions ( pdfs ) for bond length, 
bond angle and torsion angle histograms derived from the 
CSD. Gaussian kernels are used for bond length and bond 
angle data and a von Mises kernel is used for the torsion 
angle data [2]. The resulting pdfs are smooth and are sui- 
table for application to molecular geometry optimisation. 
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